What Should I Link to? Identifying Relevant Sources and Classes for Data Linking

نویسندگان

  • Andriy Nikolov
  • Mathieu d'Aquin
  • Enrico Motta
چکیده

With more data repositories constantly being published on the Web, choosing appropriate data sources to interlink with newly published datasets becomes a non-trivial problem. It is necessary to choose both the repositories to link to and the relevant subsets of these repositories, which contain potentially matching individuals. In order to do this, detailed information about the content and structure of semantic repositories is often required. However, retrieving and processing such information for a potentially large number of datasets is practically unfeasible. In this paper, we propose an approach which utilises an existing semantic web index in order to identify potentially relevant datasets for interlinking and rank them. Furthermore, we adapt instance-based ontology schema matching to extract relevant subsets of selected data source and, in this way, pre-configure data linking tools.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Co-Constitution of Health Systems and Innovation; Comment on “What Health System Challenges Should Responsible Innovation in Health Address? Insights From an International Scoping Review”

Lehoux et al provide a timely and relevant turn on the broad and ongoing discussion around the introduction of health technology and innovation. More specifically, the authors suggest a demand-driven approach to health innovation that starts from identifying challenges and demands at the health system level. In this commentary, I review a number of underlying implications of their study in rela...

متن کامل

Identifying Relevant Sources for Data Linking using a Semantic Web Index

With more data repositories constantly being published on the Web, choosing appropriate data sources to interlink with newly published datasets becomes a non-trivial problem. While catalogs of data repositories and meta-level descriptors such as VoiD provide valuable information to take these decisions, more detailed information about the instances included into repositories is often required t...

متن کامل

Identifying natural gas loss risks and ranking of corrective actions

The aim of this study was to provide a new model for identifying the sources and sources of waste gas in Mahdishahr city gas department and to define corrective measures and prioritize measures to help managers to make appropriate decisions to reduce waste gas. The research method is descriptive-analytical in terms of nature and is applied in terms of purpose. The statistical sample of the rese...

متن کامل

Integrating the Population Perspective into Health System Performance Assessment (IPHA): Study Protocol for a Cross-Sectional Study in Germany Linking Survey and Claims Data of Statutorily and Privately Insured

Background Health system performance assessment (HSPA) is a major tool for evidence-based governance in health systems and patient/population-orientation is increasingly considered as an important aspect. The IPHA study aims (1) to undertake a comprehensive performance assessment of the German health system from a population perspec...

متن کامل

I-14: Assisted Reproduction and Religion What Consideration Should Be and Should Not Be Relevant,

Several important issues are pertinent to an ethical discussion of the new technology today grouped under the name Assisted Reproduction Technology (ART). This is a moral imperative because each of us must decide what is to be considered ethically acceptable in assisted reproduction. Reflecting on the ethical dimension of ‘creating’ a new human life can play a major role in ensuring that in eac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011